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INTELLIGENT DATA MANAGEMENT 


WILLIAM J. CAMPBELL 
SPACE DATA AND COMPUTING DIVISION 
DATA MANAGEMENT SYSTEM FACILITY 
NASA/GSFC 


Abstract 

Intelligent data management is the concept of interfacing a 

USER TO A DATABASE MANAGEMENT SYSTEM WITH A VALUE ADDED 
SERVICE THAT WILL ALLOW A FULL RANGE OF DATA MANAGEMENT 
OPERATIONS AT A HIGH LEVEL OF ABSTRACTION USING HUMAN WRITTEN 
LANGUAGE. THE DEVELOPMENT OF SUCH A SYSTEM WILL BE BASED ON 
EXPERT SYSTEMS AND RELATED ARTIFICIAL INTELLIGENCE TECHNOLO- 
GIES., AND WILL ALLOW THE CAPTURING OF PROCEDURAL AND RELA- 
TIONAL KNOWLEDGE ABOUT DATA MANAGEMENT OPERATIONS AND THE 
SUPPORT OF THE USER WITH SUCH KNOWLEDGE IN AN ON-LINE, 
INTERACTIVE MANNER. SUCH A SYSTEM WILL HAVE THE FOLLOWING 
CAPABILITIES: 

i AN UNDERSTANDING BETWEEN SCIENCE APPLICATIONS AND 
STORED DATA 

• The ability to construct a model of the users view of 

THE DATABASE, BASED ON THE QUERY SYNTAX 

• The ability to transform English queries and commands 

INTO DATABASE INSTRUCTIONS AND PROCESSES 

• The use of heuristic knowledge to rapidly prune the 

DATA SPACE IN SEARCH PROCESSES 

• An on-line explanation system that will allow the user 

TO UNDERSTAND WHAT THE SYSTEM IS DOING AND WHY IT IS 
DOING IT 

Such a system will be goal oriented rather than procedure 

ORIENTED SUPPORTING VARYING LEVELS OF ABSTRACTION OF THE DATA 
OF INTEREST TO THE USER. 
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THE INFORMATION EXPLOSION 
PROBLEM/SOLUTION 
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THE APPLICATION OF ARTIFICIAL INTELLIGENCE ( A I ) TO 
EARTH SCIENCE INFORMATION MANAGEMENT OFFERS THE ONLY 
REAL POSSIBILITY OF REVERSING THIS TREND 




-J 


_j 

< 




< 

1— 

< 

h- 





< 

oc 

< 




o 

z 

z 

Q 





H— 

Q 





H- 


UJ 

LU 




C 

Z 

CD 

O 




QC 

UJ 

o 





UJ 

1 — 

QC 

< 




Q- 

CO 

a. 

LU 


i— 1 


O 

>- 


QC 


<c 



CO 

o 

< 




UJ 


z 





z 

<c 

1—4 

UJ 

z 

LU 


h— 


h- 

z 

CD 

CD 

• • 


cd 

oc 

h- 

a; 

Z 

co 

CO 

z 

o 


< 

UJ 

< 

o 

K— « 

Q- 

z 

LU 

o 


z 

Q 

Q- 

1— 1 

CO 

1— « 

1— 1 

UJ 

>-~4 

Z 


LU 

— I 

< 

1— 

> 

CO 

CO 

oc 

_J 


X 

o 




UJ 

UJ 

Ul 

o£ 

QC 

CO 

LU 

h- 

z 


a. 

o 

< 

CD 

3 

1— » 

h- 


Lu 

H- 

Z 

t— 1 

Lu 

< 

>- 



LU 


UJ 

z 

CQ 

LU 

z 

►— i 

— 1 

Q 

1 — 


CD 

CD 

CD 

< 



oc 

a 

oc 

CO 

H- • 

UJ 

UJ 

UJ 

LU 

< 


o 

3 

o 

CO 

-J 

LU 

z 

1— « 


1— 1 

z 

3 

CO 

h— 

lu 


> 


O 

LU 

QC 

1— I 

3 

OC 

< 

z 

QC 

< 

h— 

UJ 

UJ 


X 


LU 

oc 


co 

Lu 


Q 


< 

> 


o 


Z 

cn 



Q 


< 

C 

o 

to 

UJ 

UJ 

to 

z 


Lu 

1-^ 

cd 

Q 

LU 

z 

-J 



z 

Q 

►— « 

z 

< 

h- 

1— 

UJ 

C 

1— 



z 

< 

I— 1 


»— ■* 

CO 

o 

LU 

ic 

cd 

UJ 

— 1 

LU 

»— 1 

Z 

3 

oo 

ZD 

»— 4 

DC 

I— 

LU 



—J 

CO 

Z 

<c 

CD 


z 

< 

c 

H- 

oc 

c 


H— 


a_ 

a. 

LU 

z 


QC 


< 

< 

a. 

< 


< 

< 

cd 

CD 

o 

z 


UJ 







UJ 







Z 







h- 

O 






z 







o 







or 







1-82 


HOW INTELLIGENT DATA MANAGEMENT CAN SUPPORT EARTH SCIENCE 
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THE CAPTURING OF HUMAN KNOWLEDGE INTO SYSTEMS THAT 
CAN BE USED BY NON-EXPERTS 
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WOULD USE HEURISTIC SEARCH PROCESSES THAT LEAD TO SHORTCUTS 
BY EARLY PRUNING OF LARGE PORTIONS OF UNWANTED DATA SPACE 


INTELLIGENT DATA MANAGEMENT 
"ABSTRACTION PYRAMID" 
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IMAGE FEATURE DETECTION 

FEATURE AND EXTRACTION 
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THE RELATIONAL APPROACH 


DATABASES 


TAPEINFO 

NOTAPE 

TAPETYPE 

TITLE1 

TITLED 

TITLE3 

1003 

SDT 


BY... 

DATA . . . 

1004 

SDT 

NIMBUS6. .. 

BY... 

DATA... 


FILEINFO 

PB 

FILE 

NOTAPE 

17400.0 

2 

1003 

17401.0 

3 

1003 

17457.0 

2 

1004 

17459.0 

3 

1004 


RECINFO 

DATE 

TIME 

LON 

LAT 



PB 





SCAN 

790103 

124549 

158.57 

-1.00 


gum 

17400.0 

0 

ON 

NIGHT 

NO 

OFF 

790105 

145629 

25.90 

-0.96 


n 

17459.0 

0 

ON 

NIGHT 

NO 

OFF 

790117 

110204 

2.75 

10.43 

1105.00 

34.11 

17457.0 

0 

ON 

NIGHT 

NO 

OFF 



4.82 

10.84 




8 

OFF 

DAY 

YES 

ON 


QUERIES ON DATABASES 


select (t.tapetype, t.titlel, t.title2, t.title3) as result 
from tapeinfo t, 
fileinfo f, 
recinfo r 

where t.notape = f.notape 
and f.pb = r.pb 
and r.zen s 153.63 
and r.calib = ’NO’ 
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NATURAL LANGUAGE INTERACTION 
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DATA MANAGEMENT CONTROLLER 
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FIGURE 4. QUADTREE STRUCTURE (FROM HUNTER AND STEIGUTZ, 1979) 
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SELECT DATA SETS FROM LANDS AT CREATED FOR USER 

FROM THE TM SENSOR. IN BAND 2. 3. 4 
FOR QUARTERS 1, 2, 3, 4 FOR YEARS 
1983, 84, 85 FOR LAND AREA LAT/LONG 


IF QUERY SUPPORTS SOIL EROSION ANALYSIS THEN THE TYPE OF SENSOR 
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INTELLIGENT DATA MANAGEMENT CONCEPT 
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Software Management Environment 


Frank McGarry - Goddard Space Flight Center 


There exists an overall RTOP effort, supported by Code R, which is 
attempting to define, assess and integrate software measures and tools 
into an environment that will aid the management process for software 
development. During this briefing, examples of three specific areas of 
work are discussed. These areas are: 


1. Research into the development of Software Design measures 

2. Research into the development of Software Specification measures 

3. Attempts at integrating identified measures and models into a 
'Dynamic Management Information Tool 1 (DYNAMITE) 


A. Design Measures 


By closely studying and monitoring numerous software development projects 
at Goddard, several promising approaches to assessing design charac- 
teristics have been developed. One is a mechanism by which the classical 
measures of 'strength' and 'coupling' are extracted at design time and 
are used to predict reliability and overall quality of the software 
product. Early indications imply a high correlation with overall 
reliability and productivity of the end product. 


A second design measure being investigated is one that tracks the 
evolution of the general architecture of a software system during the 
design process. Early results imply that projects with apparent 
disparity between the evolution of 'data structure' and 'control 
structure' will be more likely to have reliability problems later. 

B. Specifications Measures 


Attempts were carried out at developing measures by which software 
specifications could be used to predict complexity, reliability and 
other characteristics of the software product. One approach was 
utilized where objective measures (or counts) of such items as number 
of external file requirements, size of specifications, number of 
processes, etc. resulted in a dead end; no reliable measures were 
identified. 


A second approach, called the Composite Specification Model (CSM), was 
applied to an ongoing project at Goddard and the early indicators show 
promise for this approach as a means for truly analyzing software 
complexity during the requirements phase. 

C. Dynamic Management Information Tool (DYNAMITE) 

A software tool has been designed which attempts to support the user in 
assessing software quality as well as predicting future events such as 
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schedules and cost. The tool has at its disposal a set of historical 
data on completed projects, a set of measures and models developed by 
this research effort, as well as an 'expert system' which contains a 
set of software development 'rules', also developed by this and other 
research projects. 

This tool exists in the prototype stage and currently contains about 100 
rules. It utilizes the KMS inference engine and currently has the 
capability of performing some very basic predictions and assessments of 
active projects where some very basic development information is made 
known to it. 


